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I  EXECUTIVE  SUMMARY 


In  this  report,  we  examine  the  formalism  of  coupled  mode  theory  from  several  viewpoints,  using  the  fiber 
optic  directional  coupler  as  a  model  problem  to  unify  the  discussion.  With  this  model,  we  present  the  basic 
maps  and  tools  used  in  describing  coupled  systems,  emphasizing  the  connections  between  the  approaches  by 
using  a  consistent  notation  across  phasor,  matrix  algebra,  and  geometrical  descriptions.  Novel  developments 
include: 

•  a  space/time  phasor  description  of  the  fields  in  coupler  propagation  problems, 

•  a  derivation  of  the  “gauge  transformation”  taking  coupled  wave  devices  to  the  standard  coupled  mode 
formulation, 

•  a  phasor  description  of  the  coupled  state  map  using  the  conventional  basis  set, 

•  a  technical  discussion  of  the  phase  ambiguity  in  the  geometric  viewpoint,  and  a  novel  approach  to 
using  branch  cuts  and  sheets  from  complex  variable  theory  to  resolve  it, 

•  a  careful  description  of  the  co-ordinate  change  to  the  coupled  basis  set, 

•  an  analysis  using  similarity  transforms  to  solve  the  coupled  mode  problem  directly  in  the  matrix 
description 

•  a  description  of  the  connection  between  the  geometrical  approach  and  the  conventional  dispersion 
maps  in  coupled  wave  problems. 


E-l 
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II  INTRODUCTION 
Scope 

This  report  is  meant  to  synthesize  several  views  of  coupled  mode  theory,  with  an  eye  towards  making 
connections  between  the  geometrical  approach  and  various  analytic  techniques.  By  viewing  a  standard 
problem,  the  directional  coupler,  we  illustrate  how  the  phasor,  matrix,  and  geometrical  views  complement 
each  other.  It  is  not  intended  to  be  an  exhaustive  survey  or  an  ah  initio  tutorial,  but  rather  a  reference  for 
fiber-optic  practitioners. 

By  using  the  directional  coupler  as  our  model  problem,  we  show  that  the  same  formalism  we  have 
developed  earlier  for  generalized  coupled  modes  [1]  and  for  propagation  in  birefringent  networks  [2,  3] 
can  be  applied  directly.  We  make  the  connection  between  the  2x2  matrix  approach  which  is  standard  for 
the  coupler  (“Jones  calculus”  in  optics)  and  the  3D  vector  description  of  the  same  problem,  weaving  them 
together  with  phasor  descriptions  of  the  phase  relationships.  This  3D  geometric  description  is  the  Poincare 
sphere  in  classical  optics,  and  was  generalized  to  incoherent  light  as  the  Stokes  and  Mueller  approach  [4], 
The  Stokes  description  reduces  to  the  Poincare  sphere  description  when  one  treats  monochromatic  light,  and 
we  restrict  ourselves  to  this  case. 

Background 

There  arc  several  potential  pitfalls  in  analyzing  systems  with  propagating  waves.  One  is  the  fact  that 
the  sign  convention  can  be  counterintuitive  when  switching  back  and  forth  from  space  and  time  views.  This 
is  partly  due  to  the  symmetry  of  sine  wave:  since  reversing  the  sign  of  the  argument  leaves  a  sine  wave, 
the  function  looks  the  same  in  both  2  and  t,  which  have  opposite  signs.  Another  pitfall  is  in  tracking  the 
phase  of  waves  with  different  propagation  constants:  larger  propagation  constants  mean  that  more  phase 
accumulates  per  unit  length,  and  thus  that  the  wave  travels  more  slowly.  Thus,  “advancing”  in  phase  means 
that  the  wave  is  travelling  slower.  It  is  important  to  recognize  that  the  sign  of  the  time  and  space  arguments 
changes  the  form  of  the  description,  so  that  reference  formulae  must  be  checked  from  different  sources.  We 
will  alert  the  reader  to  these  as  we  go. 

We  will  discuss  guided  travelling  waves,  cast  as  plane  waves  of  the  form 

g(z,t)  =  f(z-vt)  =  Ae^t~kz\  (1) 

namely  a  sinusoidal  wave  travelling  in  the  positive  2  direction  at  phase  velocity  v  =  u/k,  with  angular 
frequency  u  and  propagation  constant  k  =  nko  =  27rn/Ao,  where  n  is  the  phase  velocity  index  of  refraction. 
The  zero  subscripts  imply  the  propagation  constant  and  wavelength  in  vacuum. 

Our  model  system  is  a  directional  coupler  comprising  a  pair  of  waveguides  that  are  coupled  evanescently 
on  a  substrate.  A  standard  application  is  that  light  is  introduced  at  one  of  the  input  ports  and  evolves  to  light 
emerging  at  the  two  output  ports  with  equal  amplitude. 

Phasor  description 

To  help  ground  the  discussion,  we  look  at  the  uncoupled  case:  light  waves  of  equal  amplitude  are  in¬ 
troduced  into  two  waveguides  which  have  slightly  different  indices  of  refraction.  The  situation  is  illustrated 
below  in  Fig.  1.  Light  of  equal  amplitude  and  phase  is  introduced  into  input  faces  at  each  of  the  two  wave¬ 
guides  (right.  Fig.  la).  The  mode  amplitudes  arc  sketched  along  the  transverse  dimension  (axis  pointing  to 
the  right  and  down),  while  the  two  other  axes  arc  meant  to  represent  the  field’s  phasor  in  the  complex  plane. 
The  “Re  —  Im ”  axes  define  that  plane,  properly  viewed  from  a  vantage  point  up  and  to  the  left  of  the  figure 
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(shown  schematically  as  an  ’eye’).  The  phasors,  representing  the  optical  phase  rotate  CCW  in  time, 
and  at  successively  later  times  (snapshots  progressing  to  the  left)  they  have  successively  greater  angles  with 
respect  to  the  real  axis.  This  sequence  represents  the  field  at  the  input  face  with  distance  fixed  at  z  =  0. 
Note  that  we  could  also  view  this  series  of  time  snapshots  as  a  sequence  of  phasors  distributed  in  space, 
rigidly  translating  as  a  wave  towards  the  input  face.  Again,  viewed  properly  from  a  vantage  point  up  and  to 
the  left  of  the  figures,  the  phasors  can  be  viewed  as  varying  in  space  as  e~^kz,  and  would  now  appeal-  to  be 
a  sequence  rotating  in  the  opposite  direction,  namely  CW,  as  2  increases  to  the  left  for  the  observer. 

Moving  to  the  situation  inside  the  waveguides  (Fig.  lb),  we  assume  waveguide  1  (shaded  waveguide) 
has  an  index  greater  than  that  of  waveguide  2.  We  expect  the  sequence  of  phasors  to  also  rotate  CW,  e~jkz 
as  above,  when  we  fix  time  and  vary  space  by  going  deeper  into  the  waveguide.  In  both  waveguides,  the 
phasors  rotate  CW,  but  since  waveguide  1  has  a  greater  k,  its  phasor  rotates  more  rapidly  as  we  progress  to 
greater  z.  Thus,  while  both  guides  have  the  same  phase  at  the  input  face,  guide  l’s  phasor  has  gone  through 
a  greater  angle  at  each  position.  Fixing  attention  at  the  output,  one  sees  the  slightly  confusing  situation  that 
guide  1  has  advanced  more  in  phase.  If  we  now  let  time  progress,  both  phasors  rotate  at  the  same  e]uji,  and 
the  light  in  guide  2  will  go  through  the  real  axis  first.  This  means,  for  instance,  that  guide  2’s  maximum  will 
occur  at  an  earlier  time  than  guide  1  ’s  maximum  because  it  has  fewer  radians  to  move.  This  is  the  indication 
that  the  same  point  on  a  waveform  (i.e.  the  top  of  a  cosine  cycle)  happens  earlier  for  waveguide  2:  the  light 
in  waveguide  2  is  travelling  faster  than  the  light  in  waveguide  1 . 

Overview 

Given  this  basic  phasor  approach,  we  will  be  able  to  treat  the  phase  evolutions  of  the  system  consistently. 
In  the  next  two  sections,  we  present  descriptions  of  the  coupled  mode  formalism  in  both  the  standard  matrix 
algebra  approach  (culminating  in  the  basic  coupled  mode  equation,  Eqn  (9))  and  in  the  geometric  repre¬ 
sentation  we  have  used  earlier.  A  new  feature  of  this  development  is  an  explanation  of  the  ways  in  which 
the  periodicity  of  the  state  map  can  be  interpreted  using  ideas  from  complex  variables  (i.e.  branch  cuts  and 
branch  points).  A  general  map  is  developed  (Fig.  3)  which  exploits  phasor  representations.  In  Section  V,  we 
exercise  the  formalism  for  uncoupled  waveguides,  and  introduce  the  notion  that  the  physical  character  of  the 
system  is  largely  a  matter  of  the  viewpoint  enforced  by  choosing  a  basis.  Since  our  development  does  not 
describe  coupling  in  the  conventional  sense,  it  illustrates  the  “basis  prejudice”  common  in  descriptions  of 
physical  systems.  We  conclude  that  Section  by  comparing  the  “maps”  of  the  different  basis  sets,  illustrating 
the  fact  that  since  phase  is  an  important  concept  in  two-state  systems,  one  must  be  careful  when  making 
transformations  between  bases.  Finally,  in  Section  VI,  we  analyze  the  directional  coupler  itself.  Instead  of 
a  formal  mathematical  development  as  in  earlier  work  [1],  we  approach  the  problem  from  the  matrix  de¬ 
scription,  showing  that  the  state  evolution  can  be  viewed  as  a  similarity  transform  of  the  simplest  uncoupled 
description.  We  conclude  the  Section  with  a  discussion  of  the  connection  between  directional  couplers  and 
the  dispersion  curves  for  the  waveguides. 

Ill  COUPLED  STATES:  MATRIX  FORM 
Basic  coupled  mode  equation 

In  our  basic  model,  we  assume  a  pair  of  waveguides  evanescently  coupled  on  a  substrate.  The  amplitudes 
of  the  fields  in  the  two  waveguides  can  be  cast  as  a  2D  column  vector,  each  entry  corresponding  to  the 
amplitude  of  the  wave  in  that  waveguide.  The  formalisms  for  describing  propagating  waves  that  are  coupled 
in  space  such  as  these  was  introduced  in  1954  by  Pierce  [5]  as  a  matrix  representing  a  linear  transfer  function, 
and  was  applied  in  differential  from  by  Yariv  [6]  to  describe  coupling  in  electro-optic  devices  in  an  influential 
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Fig.  1:  Evolution  in  uncoupled  guides  (see  text),  (a)  Phase  of  light  at  input  face  for  a  series  of  times 
(progressing  to  the  left).  Time  dependence  for  both  guides  are  identical  (~  e]lJjL)  and  amplitudes  are  fixed, 
(b)  Phases  in  waveguides  for  fixed  time  t  =  0.  Both  phasors  rotate  CW  in  space,  and  mode  1  has  higher 
index  than  mode  2. 

article.  A  shortcoming  of  that  article  is  its  formal  description  of  the  coupling  equation.  A  more  physical 
description  is  given  by  Haus  [7],  who  presents  a  general  development  with  matrix  elements  that  are  tied  to 
the  system  parameters.  In  this  development,  the  complex  amplitudes  for  two  waves,  a\  and  d2,  are  described 
by  a  coupled  differential  equation  with  physically  descriptive  matrix  elements.  (We  use  barred  coefficients 
because  we  will  deal  with  a  transformed  version  of  them  later.)  The  coupling  equations  can  be  cast  in  matrix 
form  as  [7] 


d_ 

dz 


at 

-jki 

K12 

ai 

“2 

«21 

jkr2 

d2 

Physically,  this  equation  means  that 


(2) 


•  We  consider  only  the  two  modes,  implicitly  assuming  that  both  describe  waves  propagating  in  the 
positive  z  direction  at  optical  frequency  uj,  which  is  not  explicitly  expressed. 


•  In  the  absence  of  coupling  (off-diagonal  terms  Ktj  are  zero),  each  wave’s  amplitude,  a,;,  advances 
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in  phase  as  ai(z)  ~  aioe  >kiZ.  These  kr  arc  the  wave  numbers,  or  propagation  constants,  for  the 
uncoupled  modes. 

•  Each  wave’s  amplitude  is  modified  by  coupling  to  the  other  wave:  k%3  describes  the  changes  that 
amplitude  a i  experiences  due  to  the  presence  of  amplitude  aj  in  the  adjacent  guide. 

•  The  source  of  this  coupling  is  that  each  wave’s  field  creates  a  polarization  vector,  P,  in  the  other 
wave’s  waveguide,  and  this  polarization  is  a  driving  term  for  the  wave  equation. 

•  If  power  is  to  be  conserved[5,  6,  7],  we  must  have  n \ 2  =  —  This  makes  the  matrix  in  Eqn  (2)  a 
Hermitian  matrix  multiplied  by  —j,  preventing  the  compound  state  from  growing  in  amplitude. 

This  form  is  the  starting  point  for  all  discussions  of  coupled  mode  systems. 


Coupling  constant  k 

It  is  the  presence  of  the  off-diagonal  coupling  constant  k  that  permits  the  wave  in  one  mode  to  influence 
the  other.  Yariv  [6]  details  several  basic  forms  for  this,  but  a  simplified  form  shown  by  Haus  for  a  slab 
waveguide  model  [7]  is  the  overlap  integral  between  the  modes, 


Kij 


-ju 

4 


(3) 


where  em  is  the  dielectric  constant  inside  the  waveguide  and  e  is  that  for  the  rest  of  the  substrate.  The 
integration  takes  place  over  the  transverse  dimensions  of  waveguide  j,  namely  Aj,  and  the  transverse  fields 
for  the  modes  are  given  by  the  e.  For  identical  waveguides,  the  coupling  constant  in  Eqn  (3)  is  seen  to  be 
purely  imaginary:  k  1 2  =  k2i  =  —  j  k.  Later,  we  will  take  this  case  as  our  baseline  coupled  system. 

In  general,  for  power-conserving  systems,  we  can  cast  these  coupling  coefficients  (looking  ahead  for 
later  convenience)  in  terms  of  their  real  and  imaginary  parts  as 


and 

so  that 

Note  that 


« 12  =  Kre-jKim  =  ~  j («im  ~  j (~ «re))  =  ~j(Kl  ~  JK2) 

K:2 1  —  Kre  j  Kim  —  j  ( ^irn  T  j  (  ^re))  —  j{^l  T  j  ^2 ) 

Hi  =  Kim.  =  -Im(Ku)  k2  =  -Kre  =  -Re(n  12)  =  Re(K  21). 

I  1 2  1  1 2  2,2 

|«12 1  —  \K2i\  ~  K1  +  k2 


(4) 


Removing  the  average  phase  from  the  coupled  equation 

The  propagation  constants  k\  and  k2  are  identical  in  the  ideal  coupler,  but  in  actuality  may  not  be 
identical.  It  useful  to  separate  the  average  optical  phase  that  both  waveguides  experience  from  the  phase  dif¬ 
ference  due  to  differing  propagation  constants.  We  decompose  the  propagation  constants  into  their  average 
and  difference  components,  choosing  k\  as  the  “slow”  wave  (greater  propagation  constant)  : 

,  _  j  .  ,  _T  ^  7  h  +  h  k\  —  k2 

ki  =  k  +  k3  ;  k2  =  k-  k  3  k  =  — - —  ;  K3  =  — x —  (5) 
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This  reflects  the  fact  that  both  guides  have  an  underlying  average  optical  propagation  constant  (k,  which 
might  vary  along  the  waveguide)  but  that  the  two  guides  may  differ  slightly  (in  this  case  by  2ks).  Then  the 
coupled  mode  equation  can  be  cast  as 


d 

at 

k 

+ 

K\ 

-  JK  2 

at 

dz 

d'2 

=  ~j 

«i 

+  jK  2 

k 

-  «3 

0.2  _ 

- 

«1 

-  JK  2 

at 

=  ~j 

+  jK  2 

-  K  3 

0.2 

at 
a  2 


(6) 


where  I  is  the  2x2  identity  matrix. 

The  overall  phase  due  to  k  in  Eqn  (6)  can  be  transformed  away  [1]  by  a  “gauge  transformation”  in  which 
the  original  wave  amplitudes,  dt,  are  replaced  by  the  a,;,  their  “slowly  varying”  components: 


ai(z)  -A  e-iS~k^dzai{z). 


(V) 


Under  this  transformation,  the  derivatives  of  the  original  wave  amplitudes  become 

^  =  -jk(z)e-^~k^dzai(z)  +e-jfHz)dz^i  =  -jje(z)a.(z)  +  e-ifk(z)dz^'  (8) 

Substituting  Eqn  (8)  into  the  left  side  of  Eqn  (6),  and  substituting  Eqn  (7)  for  the  transformed  amplitudes 
into  the  right  side  of  Eqn  (6)  yields 


d 

r 

- 

_ 

r 

a  i 

«  3 

-  jn 2 

at 

dz 

02 

=  ~j 

Kl  +  jK 2 

~  K  3 

02 

The  implication  of  Eqn  (9)  is  that  the  overall  phase  (attributed  to  the  optical  path  length)  can  be  trans¬ 
formed  away  from  the  differential  equation.  That  is,  one  can  solve  Eqn  (9)  for  the  slowly  varying  amplitudes 
and  then  simply  multiply  by  the  overall  phase,  e~ ^  =  e_J  f  k(z)dzt  at  the  end  of  the  problem  to  regain  the 
original  “correct”  barred  amplitudes1.  This  separability  of  the  overall  phase  has  been  exploited  in  the  geo¬ 
metrical  representation  for  problems  that  require  total  phases[2,  3],  such  as  interferometers. 

Eqn  (9)  is  the  basic  equation  of  coupled  mode  theory  for  power  conserving  systems  in  which  the  overall 
phase  is  suppressed.  Recall  that  for  identical  coupled  waveguides,  Eqn  (4)  implies  that  only  K\  /  0. 


IV  COUPLED  STATES:  GEOMETRIC  FORM 


Normal  form 


The  wave  amplitudes  ai  and  a  2  arc  the  complex  wave  amplitudes  for  the  two  guides,  which  we  can 
consider  as  the  components  of  the  total  state  of  the  coupler,  |  'iptot)-  This  state  represents  everything  we  know 
about  the  system.  But  we  are  applying  two  constraints  by  (1)  ignoring  the  overall  phase,  and  (2)  requiring 
power  conservation.  Thus,  a  “reduced”  state  of  the  system  can  be  described  with  two  degrees  of  freedom 
(DOF),  and  these  can  be  represented  by  the  latitude  and  longitude  of  a  point  on  a  unit  sphere.  In  particular, 
we  can  simplify  the  total  state  to  this  reduced  state  as 


| iptot)  ~  {pe  j4>} 


cos(6»/2)e"^/2 

sin(#/2)e+J<V2 


cos(6/2)e 

sin(0/2)e+J'*/2 


(10) 


1  There  is  no  mathematical  adiabatic  restriction  to  “slowly  varying”  functions,  but  the  unidirectional  Eqn  (9)  requires  no  reflected 
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We  call  Eqn  (10)  the  “normal  form”  and  it  has  several  implicit  assumptions  and  conventions: 

•  We  introduce  the  description  of  our  system  as  an  abstract  vector,  jk>).  This  vector  exists  independently 
of  the  basis  set  in  which  it  is  described. 

•  The  symbol  is  used  to  distinguish  between  the  abstract  state  vector  and  the  two  dimensional 
column  which  represents  the  co-ordinates  of  the  state  vector  in  the  assumed  basis  set.  In  other  words, 
“the  map  is  not  the  territory:”  a  vector  is  not  a  set  of  coordinates.  At  the  moment,  we  arc  describing  it 
in  the  bases  corresponding  to  unit  amplitudes  in  the  two  waveguides. 

•  The  terms  in  braces  arc  the  two  DOF  that  we  have  suppressed,  namely  an  overall  magnitude  for  the 
vector,  p  (assumed  unity  for  power  conservation),  and  the  overall  phase  (p  =  (  k(z)dz  (which  is 
ignored,  since  it  can  be  reclaimed  at  the  end).  The  seemingly  redundant  factors  of  2  arc  to  make  the 
geometric  representation  more  easily  used. 

•  The  angle  9  is  the  polar  angle  in  the  geometrical  map.  Its  range  is  0  <  9  <  tt.  Thus,  9  =  0  corresponds 
to  unit  magnitude  in  waveguide  1,  the  top  co-ordinate,  with  zero  magnitude  in  the  other  waveguide, 
while  6  =  n  is  the  reverse. 

•  This  convention  means  that  the  “north  pole”  ( 6  =  0)  and  the  “south  pole”  ( 6  =  n)  correspond  to  the 
two  basis  vectors  for  the  states  (namely,  unit  amplitude  in  each  of  the  waveguides),  and  thus  that  the 
polar  angle  is  a  measure  of  the  relative  amplitudes  of  the  waves  in  those  waveguides. 

•  The  angle  <j)  is  the  phase  angle  between  the  two  waveguide  amplitudes,  its  range  is  —  tt  <  <p  <  +7 r 
to  map  onto  the  azimuthal  angle  in  the  geometrical  map.  That  is,  the  azimuthal  angle  of  the  state’s 
vector  is  the  phase  difference  (modulo  27 r)  between  the  waveguide  components. 

•  Once  the  magnitude  and  the  overall  phase  are  stripped  away,  the  arrow  indicates  that  only  the  two 
angles,  9  and  <b.  are  needed  to  represent  the  relative  amplitudes  and  phases  of  the  components,  the  two 
DOF  we  consider. 

•  We  have  used  the  “positive  frequency”  convention  for  time,  and  thus  a  wave  propagating  towards 
more  positive  z  coordinates  will  change  phase  as  ~  e~jkz.  This  takes  a  bit  of  getting  used  to.  One 
way  of  viewing  it  is  to  visualize  a  phasor  representation  as  in  the  Introduction:  as  time  progresses, 
the  phasor  representing  the  wave  will  rotate  CCW  in  the  “positive”  direction.  To  follow  a  given  point 
on  the  waveform,  one  would  have  to  advance  in  z  to  keep  up  with  the  wave,  “unwinding”  the  phase 
accrued  over  time  by  rotating  in  the  “negative”  direction  (CW)  as  one  proceeds  down  the  z  axis. 


Mapping  normal  form  to  3D  representation 

The  utility  of  the  geometric  representation  is  that  it  leads  to  a  coordinate-free  description:  a  state  is  then 
represented  by  a  point  on  the  sphere,  and  we  can  often  describe  its  trajectory  by  using  geometrical  means, 
such  as  an  equation  of  motion  (described  later).  Thus,  we  want  to  be  able  to  switch  back  and  forth  between 
the  2D  normal  form  vector  and  the  3D  representation  with  ease.  To  do  this,  we  need  a  “map”  as  described 
below.  We  assume  an  orthogonal  basis  set  for  the  3D  representation,  namely  {ei,  §2,  e:j } . 

We  associate  two  basis  state  vectors,  which  correspond  to  the  standard  basis  vectors  in  2D, 


1)  ~ 


1 

0 


1 2)  ~ 


0 

1  ’ 
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and  represent  them  in  the  geometrical  representation  as  the  north  and  south  poles  of  a  unit  sphere,  namely 
unit  vectors  ±63.  Then  any  state  expressed  in  the  normal  form  has  a  unique  latitude  and  longitude,  i.e.  a 
unique  9  and  4>,  and  thus  a  unique  position  on  the  surface  of  the  unit  sphere,  as  shown  below. 


Fig.  2:  Geometric  representation  of  states.  The  ‘top’  state,  |1)  is  at  the  north  pole  ( 8  =  0),  corresponding 
to  unit  amplitude  in  waveguide  1.  General  state  |i />)  is  represented  by  polar  angle  8  and  azimuthal  angle  0, 
which  represent  relative  amplitudes  and  phases  between  fields  in  the  waveguides,  respectively. 


is  called  the  “even”  state  since  it  is 
the  two  basis  states.  Thus,  in  the  3D 


A  general  state  is  shown  in  Fig.  2,  which  highlights  three  of  the  principal  states  often  used  in  describing 
the  system  behavior.  The  “north  pole,”  or  <33  is  state  1 1),  the  “top”  basis  vector  (i.e.  the  field  is  only  in 
waveguide  1).  In  light  of  the  normal  form,  Eqn  (10),  it  must  be  that  8  =  0,  and,  since  no  light  is  in  the  other 

state,  the  phase  (j)  is  indeterminate.  The  state  given  by  |e)  =  j 

the  symmetric  (equal  amplitude,  zero  phase  difference)  supeiposition  o 
representation,  it  defines  the  axis  for  which  the  azimuthal  angle  is  zero.  It  corresponds  to  ( 8  =  tt/2.  cj)  =  0), 
which  is  defined  as  ei  .  The  third  labelled  state,  |+),  is  the  “phase  advanced”  state.  Since  9  =  <f>  =  7t/2  for 
this  state,  it  corresponds  to  £2  in  the  3D  representation,  and  in  the  standard  basis  it  is  represented  in  normal 

r  e-j(n/ 2)/2  1 

form  as  +j(n/2)/2  '■  the  phase  of  the  component  in  state  |1)  is  7r/4  more  advanced  than  the  average 

phase,  while  the  component  in  orthogonal  state  1 2)  is  7t/4  behind  the  average  phase.  Thus  the  top  component 
(guide  1)  is  advanced  by  7t/2  with  respect  to  the  bottom  component  (guide  2).  From  the  viewpoint  of  1 1) , 
state  |+)  has  been  rotated  in  a  positive  sense  (right  hand  rule)  from  | e) ,  the  zero-phase  state,  while  from  the 
viewpoint  of  state  1 2) ,  state  |+)  has  been  rotated  in  a  negative  sense  from  |e).  Physically,  this  means  that 
for  state  |+),  its  component  from  |1)  has  acquired  more  phase  (e~?5r/4),  while  its  component  from  |2)  has 
lost  phase,  (e+J7r/4).  That  is,  the  component  in  |1)  is  the  “slower”  of  the  two  components:  it  has  a  larger 
propagation  constant,  and  acquire  more  phase  every  meter  than  does  the  component  in  1 2). 

Note  that  in  the  geometric  representation,  both  \e)  and  |+)  are  at  an  angle  of  7r/2  from  the  basis  vector 
1 1) ,  but  their  amplitudes  (see  Eqn  (10))  in  that  basis  state  are  the  cosine  of  half  that  angle,  namely  cos  ^  = 
This  illustrates  the  general  point  that  angles  in  the  geometrical  representation  are  twice  the  “real” 
angles:  two  states  that  are  orthogonal  must  be  on  opposite  sides  of  the  sphere.  Thus  the  states  orthogonal 
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to  1 1),  |e),  and  |+),  arc  |2),  |o),  and  |— ),  (bottom  basis,  odd  (or  anti-symmetric),  and  phase  retarded  states, 
respectively)  and  are  located  on  the  corresponding  opposite  sides  of  the  sphere. 

In  light  of  the  above,  we  see  that  the  state  shown  in  Fig.  2  has  most  of  its  power  in  waveguide  1,  and 
the  phase  of  the  light  in  waveguide  1  is  phase  advanced  with  respect  to  the  light  in  waveguide  2  by  cj>.  An 
important  point  to  note  from  Eqn  (10)  is  the  “cosine  half-angle  projection”  property,  i.e.  that  the  amplitude 
of  a  unit  state,  projected  onto  another,  is  given  by  the  cosine  of  half  the  angle  between  their  vectors.  That  is, 
for  instance,  the  amplitude  of  \i/t)  in  Fig.  2,  projected  onto  state  1 1)  is  cos  |,  from  the  normal  form,  while  its 
amplitude  projected  onto  state  1 2)  is  cos  =  sin  § :  the  left  side  of  this  relation  is  from  the  geometrical 
construction  (since  the  angle  from  \ip)  to  \2)  is  tt  —  6),  while  the  right  side  is  from  the  normal  form,  Eqn 
(10). 

The  above  property,  the  cosine  half-angle  projection,  is  true  between  any  two  states  generally:  by  suit¬ 
able  co-ordinate  transformation,  one  can  always  move  one  of  the  states  to  1 1)  and  then  project  onto  it  as  in 
Eqn  (10).  This  leads  to  a  real  advantage  for  the  geometrical  representation:  it  permits  a  direct  visualization 
of  changes  in  basis  states.  Consider  the  held  shown  in  Fig.  2  as  state  | '(/’).  In  the  basis  we  have  chosen,  it 
represents  the  labelled  distinct  values  of  relative  amplitudes  for  the  two  guides,  and  relative  phases  between 
the  fields.  If  we  were  to  change  the  basis,  for  instance,  so  that  the  even  and  odd  superposition  states  |e)  and 
|o)  were  used  as  basis  states,  the  actual  in  the  guides  would  be  the  same,  and  they  would  correspond  to 
that  same  point  on  the  unit  sphere  (and  thus  the  same  state  | '(/’))  but  the  coordinates  of  that  point,  its  latitude 
and  longitude  with  respect  to  |e)  and  |o),  would  be  different.  Properly  viewed,  this  is  a  liberating  viewpoint 
to  adopt:  any  two  antipodal  points  can  become  a  basis  set,  and  we  are  free  to  choose  them  as  a  basis  to  suit 
our  convenience.  Thus,  a  given  trajectory  for  a  state  can  be  analyzed  instantly  in  terms  of  other  physical 
bases.  In  the  next  few  sections,  we  will  illustrate  several  views  of  the  basic  coupler  model. 

State  map:  technical  details 

To  flesh  out  this  map,  the  phasor  characteristics  of  the  states  are  shown  in  Fig.  3,  below,  using  the  same 
hybrid  phasor  notation  we  used  in  Fig.  1.  For  instance,  state  \e)  is  seen  to  be  the  even  combination  of  the 
two  basis  states,  |1)  and  |2),  while  state  |+)  is  the  state  for  which  the  phase  difference  between  the  bases  is 
7t/2  (i.e.  the  bases  are  advanced  and  retarded  by  7r/4,  respectively,  as  in  the  normal  form,  Eqn  (10)). 

While  the  phase  difference  between  components  in  the  two  bases  can  grow  without  bound,  the  map  can 
only  display  one  transit  of  the  sphere,  27r  radians.  This  leads  to  a  technical  issue  which  we  address  here. 
Consider  how  states  with  increasing  phase  difference  in  Eqn  (10)  would  be  represented:  we  imagine  starting 
with  state  |e),  changing  only  the  relative  phase  between  the  components,  and  observing  how  the  states  are 
mapped.  State  \e)  has  zero  phase  difference  and  as  one  increases  the  phase  difference,  the  state  would  evolve 
along  the  equator,  reaching  state  |+)  when  the  phase  difference  reaches  7r/2  (i.e.  ±7t/4),  and  finally  arriving 
at  the  state  labelled  |o+)  as  the  phase  approaches  tt  (Fig.  3  ).  On  the  other  hand,  if  we  started  at  state  \e) 
and  decreased  the  phase  difference,  the  state  would  first  evolve  to  |  — )  and  then  approach  the  state  labelled 
|o_)  as  the  phase  difference  approaches  —  tt.  It  seems  as  if  state  |o)  has  two  different  representations:  |o+) 
and  |o_)  have  distinctly  different  shapes  in  Fig.  3,  depending  on  the  path  taken  from  state  |e). 

What  is  going  on  is  that  the  sphere  is  a  “cover”  for  the  phases:  it  can  only  represent  phases  modulo  27t. 
This  is  shown  in  Fig.  4.  Technically,  we  can  view  the  sphere  as  a  “sheet”  covering  phases  —  tt  <  (/>  <  7r, 
and  as  the  phase  increases  past  state  |o+)  (i.e.  on  its  way  from  |e)  through  |+)  to  |o+)  and  then  beyond 
|o_|_)),  it  passes  over  a  “branch  cut”  at  <fr  =  tt  onto  another  sheet,  this  one  representing  tt  <  cj)  <  3tt.  Points 
on  this  sheet  (for  increasing  (t>)  have  27r  more  phase  than  equivalent  points  on  the  first  sheet.  On  this  sheet, 
the  state  whose  phase  has  recently  become  greater  than  tt  now  has  a  phase  co-ordinate  ~  —  tt  but  has  an 
additional  phase  of  27r  by  virtue  of  having  passed  onto  this  new  sheet.  Thus,  in  going  from  |e)  to  |o+),  the 
phase  difference  has  increased  by  7r  (shown  as  ±7r/2  in  Fig.  3).  Just  as  it  passes  over  |o),  the  co-ordinate 
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2> 


Fig.  3:  Map  of  states  as  phasors  using  normal  basis.  Note  apparent  discontinuity  associated  with  state  |o). 
Inset:  longitude  lines  of  equal  phase  difference. 


Fig.  4:  Phase  co-ordinate  plotted  vs.  actual  phase.  Since  the  co-ordinate  for  the  state  is  limited  to  —  n  <  0  < 
ir,  phases  must  be  ‘reset’  every  2ir  radians.  The  actual  phase  is  continuous  (dotted  line)  but  its  co-ordinate 
representation  is  not  (heavy  line  segments). 


changes  from  +7r  to  —  n  while  the  phase  has  simultaneously  jumped  by  2n  from  passing  onto  the  new  sheet. 
Thus,  the  phase  is  continuous,  while  the  co-ordinates  change  discontinuously  across  the  branch  cut.  This 
sheet  is  illustrated  in  Fig.  5  as  being  outside  the  original  sheet  for  increasing  0.  In  a  similar  way,  if  we  view 
states  with  decreasing  phase,  the  state  passes  onto  another  sheet  (shown  as  inside  the  original  sheet  in  Fig. 
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5)  as  it  crosses  the  branch  cut,  but  this  sheet  is  displaced  by  —  2n. 


Fig.  5:  Sketch  of  sheets  folded  at  branch  cut.  As  phase  increases  past  4>  =  n,  the  state  moves  to  another 
sheet  (green  sheet,  partially  shown,  outside  first  sheet)  as  it  passes  over  branch  cut.  Conversely,  for  phases 
decreasing  below  —  tv,  the  state  moves  to  a  different  sheet  (red  sheet,  partially  shown,  inside  first  sheet)  as  it 
passes  over  the  branch  cut.  Each  set  of  co-ordinates  implicitly  describes  points  on  other  sheets. 


What  is  clear  from  this  rather  technical  discussion  is  that  this  is  a  “co-ordinate  problem”  not  a  “state 
problem:”  nothing  discontinuous  happens  to  the  state,  the  apparent  discontinuity  is  an  artifact  of  the  map’s 
cyclical  re-use  of  a  fixed  range  of  longitude  co-ordinates.  This  is  somewhat  analogous  to  the  situation  in 
solid  state  physics  in  which  the  wavevector  of  an  accelerating  Bloch  electron  is  “umklapped”  back  to  the 
other  side  of  the  Brillouin  zone  to  keep  its  wavevector  in  the  first  Brillouin  zone.  In  our  case,  the  phase 
difference  is  increasing  monotonically  (we  imagine  a  counter  that  keeps  track  of  the  times  the  state  has 
wound  around  the  branch  cut)  but  the  representation  looks  non-mono  tonic. 

The  branch  points  at  which  the  branch  cut  (0  =  7 r)  ends  arc  the  two  basis  states,  |1)  and  |2).  One  can 
visualize  the  sheets  being  “folded  in”  at  these  singular  points,  as  shown  in  Fig.  5.  The  singularity  at  the 
branch  points  can  be  seen  in  light  of  the  normal  form,  Eqn  (10),  and  Figs.  3  and  5.  That  is,  all  states  on  lines 
of  a  given  longitude  have  the  same  phase  difference  between  their  components,  regardless  of  their  relative 
amplitudes  (i.e.  latitude).  At  the  poles,  however,  only  one  component  exists,  so  the  phase  difference  is 
meaningless.  Thus,  if  we  imagine  traversing  a  small  circle  near  the  north  pole,  the  phases  on  that  circle  go 
through  a  range  of  2-7r,  but  as  the  circle  (with  a  continuum  of  longitudes)  shrinks  to  the  point  of  the  pole, 
there  is  only  one  state.  Again,  the  states  are  continuous  but  the  co-ordinates  are  discontinuous. 

As  a  practical  matter,  the  discontinuities  due  to  the  branches  and  branch  points  are  inconsequential.  The 
issue  is  really  only  a  co-ordinate  problem,  not  a  mapping  problem,  and  any  trajectory  of  a  state  will  evolve 
smoothly  and  continuously:  in  terms  of  the  co-ordinate  mapping  to  a  “covering”  range  of  27r,  however,  it 
may  look  discontinuous. 


Coupled  Mode  Theory 


11 


V  STATE  EVOLUTION  IN  UNCOUPLED  GUIDES 

The  simplest  case  of  the  general  coupled  mode  equation  is  for  two  uncoupled  waveguides:  since  there 
is  no  coupling  between  them,  their  fields  propagate  independently. 


Identical  waveguides 


When  two  uncoupled  waveguides  arc  identical,  their  propagation  constants  arc  identical,  and  by  virtue 
of  Eqn  (5),  we  have  k\  =  &2  =  k  and  />•;;  =  0.  Since  there  is  no  coupling,  k\  =  K2  =  0  by  virtue  of  Eqn  (4). 
From  the  basic  coupled  mode  equation  we  then  have 


A 

dz 


at 

0 

0  ' 

at 

at 

CL2  _ 

=  -3 

0 

0 

“2  _ 

a2 

constant. 


If  the  2D  state  vector  remains  constant,  then  the  normal  form  implies  that  the  3D  representation  also  remains 
constant,  so  the  state  vector  shown  in  Fig.  2  remains  fixed.  This  is  as  expected:  since  the  amplitudes  in 
each  waveguide  remain  unchanged,  9  remains  unchanged;  since  the  waves  suffer  identical  phase  shifts  as  z 
progresses,  there  is  no  change  in  their  phase  difference  and  0  remains  constant.  What  has  been  omitted,  of 
course,  is  that  in  propagating  down  the  waveguides  there  is  a  continual  advance  of  phase,  which  we  would 
expect  to  describe  with  the  slowly-varying  gauge  transformation,  Eqn  (7). 


Distinct  waveguides  -  normal  basis 


A  more  interesting  case  is  for  uncoupled  waveguides  with  different  propagation  constants.  Then  = 
K2  =  0,  since  they  are  uncoupled,  but  k\  /  k‘2  implies  that  K3  /  0.  In  this  case,  Eqn  (9),  the  coupling 
equation,  becomes 


d  a\ 
dz  <22 


r  +K3  0 
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-K  3  02 

(11) 


The  solution  is  immediate:  a  1.2  ~  e^JK:iZ  with  the  implication  that  the  phase  of  wave  1  (assumed  to  be  the 
“slow”  mode)  advances  with  respect  to  that  of  waveguide  2.  Indeed, 


ai(z) 

e~jK3Z  0 

'  01 (0)  ' 

'  ai(0)e-^3*  ' 

(cos^-e  ^^2)e  i  23 

02(2)  _ 

0  e+iK3Z 

.  oa(O)  _ 

a2(0)e+iK3Z  _ 

sin^e+^i^2)e+^^~ 

Cast  in  this  form,  Eqn  (12)  permits  direct  comparison  to  Eqn  (10):  6  is  fixed  at  its  initial  value,  9i,  since 
the  amplitudes  remain  constant,  while  <j)  increases  linearly  with  distance,  namely  A0  =  2k:>,z.  That  is, 
the  slow  guide  is  monotonically  accumulating  phase  with  respect  to  the  fast  guide  at  a  rate  which  is  the 
difference  between  the  propagation  constants  (Eqn  (5)).  Note  that  in  this  case,  too,  angles  in  the  geometric 
representation  seem  doubled  compared  to  the  matrix  representation  since  it  is  cast  in  terms  of  difference 
between  the  mean  and  the  phases  of  the  two  waveguides.  The  motion  of  the  state  vector  as  the  waves 
progress  down  the  two  guides  is  then  a  precession  about  the  §3  axis,  as  shown  in  Fig.  6. 

This  situation  is  described  as  “uncoupled,  asynchronous”  propagation:  the  two  waveguide  modes  do  not 
exchange  power  (uncoupled,  since  6  is  constant)  but  there  is  a  continual  phase  slippage  between  them  {<f> 
increases  monotonically):  their  phases  are  asynchronous. 


Distinct  waveguides  -  alternate  basis 

As  mentioned  above,  the  geometric  view  permits  easy  visualizations  for  a  change  of  basis.  In  particular, 
we  show  here  a  change  of  basis  to  the  even  and  odd  states,  a  basis  that  will  later  be  seen  as  important. 
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Fig.  6:  Uncoupled,  asynchronous  evolution.  The  relative  amplitudes  remain  the  same  {6  is  constant),  but 
the  relative  phase  difference  increases  monotonically. 


Alternate  basis 

Now  consider  the  system  viewed  in  a  different  basis,  which  we  will  denote  by  stars  as  |1*)  and  |2*). 
We  will  take  these  new  basis  vectors  to  be  the  states  |e)  and  |o),  respectively.  While  we  could  re-draw  the 
map  with  these  states  as  the  north  and  south  poles,  the  most  useful  view  is  generally  to  view  the  states  as 
primary  and  re-draw  the  co-ordinate  axes.  Thus  the  new  primary  axis  runs  between  |1*)  and  |2*),  and  the 
new  co-ordinates  are  referred  to  this  axis.  From  the  definitions  of  the  cardinal  points  in  normal  form,  we 
can  express  each  of  them  as  superpositions  of  their  new  basis  states,  and  hence  in  terms  of  the  original  basis 
states.  We  find  that 

|1*>  =  le)  |2*)  =  |o) 


^(|1*>  +  |2*» 


7i(|e)  +  |o>) 


1+) 


1+*)  =  -L(e-^/4  |U)  +  e+W4  |2*))  =  e-W4  |1) 
V2 

|o*)  =  -^=(e~jn/2  |1*>  +  e+j7r/2  |2*))  =  e ~jw/2  |-) 

-*)  =  -^(e+j7r/4  |1*)  +  e”^/4  |2*))  =  e+^Z4  |1) . 

v2 


(13) 


These  relationships  are  plotted  in  Fig.  7  below.  Thus,  for  instance,  |e*)  in  the  second  line  above  is  defined 
in  the  left  equality  by  appealing  to  the  normal  form,  the  basis  definitions  are  used  in  the  second  equality, 
then  the  geometric  sums  are  constructed  and  compared  to  Fig  3  for  the  last  equality.  As  the  inset  shows,  the 
lines  of  constant  phase  difference  are  now  oriented  differently  than  for  the  normal  basis.  Indeed,  viewing 
the  phase  of  the  |1*)  component  (the  leftmost  component  in  the  graphical  sums  in  Fig.  7),  one  sees  that  in 
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going  from  state  |— *)  — »  |e *}  — y  |+*)  — y  |o*},  the  phase  of  the  |1*)  component  changes  monotonically 
as  it  rotates  CW  in  the  Re  —  Im  frame.  The  figure  shows  the  superposition  of  the  two  basis  states  and 


Fig.  7:  Map  of  states  as  phasors  using  even-odd  basis.  Inset:  lines  of  constant  phase  difference  between 
basis  states. 

their  resultant  sums.  When  the  Figs  3  and  7  are  placed  side-by-side,  one  sees  that  there  are  additional 
phases  involved.  That  is,  for  instance,  the  state  |— *)  is  not  exactly  the  same  as  state  1 2) ,  but  has  an  extra 
phase  shift  of  7t/4.  This  is  a  manifestation  of  the  fact  that  the  phase  differences  seen  in  the  insets  have 
fundamentally  different  orientations.  Such  extraneous  phase  relationships  often  occur  in  polarization  optics 
when  transforming  from  one  basis  to  another,  and  are  related  to  the  unitary  transformations  that  create  the 
co-ordinate  transformations:  they  are  ultimately  rotations  and  rotations  do  not  commute. 


Uncoupled  waveguides  in  alternate  basis 

Returning  to  the  “uncoupled,  asynchronous”  state  evolution  described  in  Fig.  6,  we  now  address  the 
evolution  of  the  system  in  our  new  basis  set.  That  is,  we  want  the  same  trajectory  of  states  in  Fig.  6  to  be 
described  in  our  new  co-ordinate  system.  In  the  new  system,  the  co-ordinates  are  constructed  by  projecting 
states  onto  the  |1*)  and  |2*)  basis  set,  finding  the  angles,  and  using  the  normal  form,  Eqn  (10).  The  projection 
onto  |1*)  is  given  by  half  the  angle  of  9e,  the  angle  to  1  * ) .  Our  discussion  above  shows  that  the  azimuth 
in  this  basis  is  measured  from  |e*),  which  provides  4>e  for  the  normal  form.  In  this  basis,  note  that  now  the 
modes  are  viewed  as  “coupled”  since  there  is  power  exchanged  between  them.  That  is,  angle  9e  varies  as 
the  state  cycles  around  in  Fig.  9,  and  thus  the  amplitudes  in  the  two  modes  vary  by  virtue  of  Eqn.  (10).  The 
relative  phase,  however,  is  bounded  as  the  state  follows  its  trajectory.  (There  is  more  to  it,  but  it  requires 
introducing  “fiducial  paddles”  [2,  3].)  Thus,  in  this  basis,  the  system  is  said  to  be  displaying  “coupled, 
synchronous”  behavior.  In  other  words,  what  looks  like  uncoupled  “beating”  of  independent  modes  in  one 
basis  set,  can  be  viewed  as  a  synchronous  power  transfer  between  modes  in  another  basis  set. 
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Fig.  8:  Comparison  of  states  for  two  basis  sets. 


Fig.  9:  ’’Uncoupled,  asynchronous”  evolution  in  Fig.  6,  as  viewed  in  another  basis.  In  this  basis,  power 
varies  as  the  state  evolves. 

VI  STATE  EVOLUTION  IN  COUPLED  GUIDES:  THE  DIRECTIONAL  COUPLER 
Form  of  vector  evolution 

The  simple  case  above  encapsulates  a  great  deal  of  the  coupled  mode  formalism.  The  key  point  is  that  the 
coupled  mode  formalism  is  generally  described  by  a  rotation  of  the  state  vector  about  some  axis:  that  axis, 
and  the  angular  velocity  of  the  rotation  about  it,  are  determined  by  the  matrix  in  the  basic  equation,  Eqn  (9). 
Regardless  of  basis,  the  state  evolves  on  some  trajectory,  and  the  overall  state  can  viewed  in  different  bases 
with  different  physical  pictures.  That  is,  the  same  underlying  phenomena  can  be  viewed  in  different  ways, 
depending  on  the  basis  employed,  and  we  are  free  to  choose  any  basis  which  is  convenient.  This  makes  the 
study  of  such  systems,  whether  coupled  waveguides,  polarization  optics,  or  atomic  systems,  much  richer 
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and  more  susceptible  to  physical  insight  and  intuition  [8] . 

What  makes  such  physical  systems  yield  to  detailed  geometrical  analysis  is  the  ability  to  express  the 
behavior  in  the  language  of  linear  algebra  and  vectors,  to  which  we  now  turn.  This  development  is  famil¬ 
ial-  from  classical  optics  as  the  Jones  calculus,  Poincare  sphere,  Stokes,  and  Mueller  calculus  [4,  9].  The 
connection  between  the  matrix  view  (the  “Jones”  view  in  polarization  optics)  and  the  geometric  view  (the 
“Stokes/  Poincare”  view  in  polarization  optics)  is  founded  in  the  representation  theory  of  matrices.  The  2x2 
complex  matrices  are  spanned  by  the  identity  matrix  and  the  three  Pauli  matrices 


'  1 

0  ' 

'  0 

1  ' 

0  -  j  ' 

'10 

<70  = 

0 

1 

;  0-1  = 

1 

0 

;  02  = 

_+j  0 

;  0-3  = 

0  -  ! 

Thus,  any  matrix  K  can  be  expressed  as  K  =  K,at  for  four  complex  numbers  k,  (implied  summation).  If 
K  is  Hermitian  and  traceless,  then  the  n,  are  three  real  numbers  for  i  =  1,  2, 3,  which  can  be  viewed  as  a 
vector  triplet  («i,  k2 ,  )  =  k.  Viewing  Eqn  (11),  one  sees  that  it  is  of  form 


d  a\ 
dz  a  2 


+«3 

0 

0 

-«3  . 

-JK  3 


+1  0 

0  -1 


a  i 
«2 


-jn  3  <73 


at 

02 


(15) 


In  this  case,  the  coupling  matrix  is  characterized  by  only  the  third  component,  so  we  can  associate  the  vector 
k  =  K:je:j.  Notice  that  the  vector  for  |v)  then  rotates  about  k  at  a  rate  of  2k:>„  by  virtue  of  Eqn  (12).  As 
we  will  show  below,  this  result  is  completely  general  as  long  as  K  is  Hermitian  and  traceless.  For  such 
conservative  systems,  the  coupled  mode  equation  (Eqn  (9))  can  be  expressed  as 


ai 

«3 

Ki  -  jn2 

at 

CL2  _ 

=  ~j 

«1  +jK  2 

-«  3 

°2 

a\ 

ai 

a2  _ 

=  -jZ- a 

fl2  _ 

(16) 


and  when  it  is,  it  means  that  the  geometric  representation  for  state 


a  i 
«2 


will  rotate  about  the  vector  k  at  a 


rate  of  2  |E|.  Thus,  Eqn  (16)  shows  that  the  vector  k  is  the  geometrical  representation  of  the  matrix  operator 
in  Eqn  (9). 


Matrix  analysis 

Differential  equation  for  state 

Eqn  (16)  is  the  general  form  that  the  coupling  can  take:  K3  accounts  for  “detuning,”  or  mismatches  in  the 
waveguide  propagation  constants,  while  k\  and  k2  describe  the  coupling  between  the  waveguides.  Direct 
solutions  of  Eqn  (16)  have  been  tabulated  [10]  for  conservative  systems  such  as  these.  Here,  we  approach 
the  problem  in  a  way  that  highlights  the  application  to  a  geometrical  representation.  Setting 


K 


k\  =  |«iei  +  K2e2  +  K3e3 


ni  +  +  k 


2 

3’ 


there  must  be  (see  Fig.  10)  angles  Oq  and  q>o  such  that  magnitude  n  can  be  factored  out  of  the  coupling 
equation,  Eqn  (16): 


at 

K  3 

K\  ~  jK  2 

ai 

a2  _ 

=  -3 

K\  +jK  2 

~«3 

a2  _ 

d 

a\ 

COS  80 
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dz 
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=  -JK 

sin  80  e+j(t>0 

—  COS  00 

-JkD 


a  1 
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(17) 
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Fig.  10:  Vector  k  represents  three  components  (nt)  in  the  coupling  matrix.  This  vector  can  be  chararacter- 
ized  by  its  length  |k|  and  two  angles,  6q  and  do. 

The  matrix  D,  defined  in  this  equation,  is  the  generator  for  this  system: 

r  cos  0O  sin#0  1 
sin  e+^°  —  cos  9q 


Eigenbasis  and  similarity  transforms 


The  eigenvectors  for  D  are  given  by 


cos  ^  e  ifio/2 
sin  f  e+^°/2 


I/)  ~  /~ 


sin  ^  e~^°/2 
—  cos  ^  e+^°/2 


(19) 


where  we  have  labelled  the  states  to  conform  to  our  earlier  descriptions  as  “slow”  and  “fast”  [2,  3],  and 
where  the  eigenvalues  are  ±1.  Clearly,  s  is  in  the  normal  form  for  a  vector  such  as  in  Fig.  2.  In  fact,  s  is 
explicitly  k,  the  unit  vector  along  the  direction  of  k.  We  will  see  that  the  differential  equation  for  the  state 
evolution  will  be  a  rotation  about  this  unit  vector,  as  noted  above. 

It  can  be  shown  [2]  that  /  can  also  be  cast  as  a  state  in  normal  form  with  9q  >  n  —  do  and  do  — >■  d>o  —  tt 
and  multiplied  by  a  constant  phase  factor  e~ ■7'7r/2.  This  is  the  state  on  the  opposite  side  of  s  in  the  geometrical 
representation  [2]  and  the  phase  factor  indicates  that  the  “fiducial  paddle”  for  this  state  [2,  3]  is  rotated  by  7 r 
radians. 

Creating  a  matrix  from  the  adjoints  of  s  and  /,  we  can  create  a  matrix  S~''  that  transforms  co-ordinates 
in  our  conventional  basis  to  the  basis  defined  by  eigenstates  s  and  /,  and  a  matrix  S  that  does  the  reverse. 
That  is,  we  can  view  matrix  S t  as  being  formed  with  the  top  row  set  as  (s|  ~  V’  and  bottom  row  set  as 
(/I  rs_/  ff  When  this  matrix  operates  on  any  2D  vector,  it  dots  that  vector  into  (s|  and  enters  this  as  the 
top  component  of  the  new  vector  and  similarly  for  (/|  as  the  bottom  component.  That  is,  it  transforms  the 
co-ordinates  of  the  vector  in  the  original  basis  to  the  co-ordinates  of  the  same  vector  in  the  s  —  f  basis,  i.e. 
it  takes  |s)  =>-  |1).  In  a  similar  way,  the  matrix  S  comprises  the  column  vectors  s  and  /  and  creates  the 
inverse  operation,  taking  |1)  =>  |s).  Using  our  normal  forms, 
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'  cos  f  e+7^o/2 

sin  f  e~^°/2 

Q  — 

'  cos  I  e-^o/2 

sin  f  e“J'*>/2 

sin  f  e+ofo/2 

-  cos  f 

o  — 

_  sin  §  e+i0o/2 

-  cos  f  e+3<t>o/2  . 

5f  = 


This  permits  us  to  make  similarity  transforms  between  the  bases  for  operators: 


(20) 


D  = 


cos  9q  sin  #o  e  ^ 0 
sin  Qq  e+^°  —  cos  9q 

D  =  SD0S] 


Do  = 


1  0 
0  -  1 


Do  =  S^DS 


(21) 


In  words,  this  says  that  D,  operating  on  a  vector  in  the  conventional  basis  set,  is  equivalent  to  the  operator 
D$  operating  in  the  eigenvector  basis  set.  That  is,  if  we  are  in  the  conventional  basis  we  must  first  transform 
the  vector’s  co-ordinates  to  the  eigenbasis  with  S',  then  operate  on  the  result  with  Do,  the  operator’s  rep¬ 
resentation  in  the  eigenbasis,  and  finally  use  S  to  transform  the  resultant  vector  back  to  co-ordinates  in  the 
conventional  basis.  The  reason  we  do  this  is  to  enable  a  simple  integration  of  the  coupled  mode  differential 
equation,  Eqn  (9). 

Solution  of  the  coupled  mode  equation 

We  can  formally  integrate  the  coupled  mode  equation,  Eqn  (17),  as 
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dz 
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Oi  (z) 
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(22) 


as  can  be  seen  by  substituting  into  the  differential  equation.  We  assume  that  the  coupling  does  not  vary 
along  the  length  of  the  coupling  region,  so  that 

—j  f  RD  dz  =  —j RDz. 
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Because  Do  is  diagonal,  Dq  is  also  diagonal:  D g  = 
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.  This  simplifies  the  expression  to 
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or 


ai  {z) 
CL2  (z) 


S 


e~iRz  0 
0  e+jRz 


S' 


ai  (0) 
a2  (0) 


(23) 


This  equation  exemplifies  the  spectral  theorem  [2,  11]  from  linear  algebra.  In  words,  it  says  that  once 
the  initial  2D  state  vector  has  been  transformed  to  the  eigenbasis  by  means  of  co-ordinate  transformation 
S',  the  top  and  bottom  components  are  the  amplitudes  of  that  state  vector  in  the  slow  and  fast  eigenstates, 
respectively.  These  amplitudes  are  not  mixed,  by  virtue  of  the  diagonal  matrix,  but  are  multiplied  eTJKZ  as 
they  propagate  a  distance  z  through  the  coupler.  At  the  end  of  the  propagation,  the  two  components  are  co¬ 
ordinate  transformed  back  to  the  original  basis  by  S,  to  give  the  final  output  vector.  Performing  the  matrix 
multiplication  indicated  in  Eqn  (23),  one  has  the  general  solution  for  the  ideal  directional  coupler: 


at  (z) 

0,2  (z) 

cos  kz  —  j  cos  #o  sin  kz 
_je+j<t>o  sin  Q0  sin  Rz 


—je  sin  do  sintcz 
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ai  (0) 
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(24) 


For  the  particular  case  of  an  ideal  3  dB  coupler,  we  would  have  kz  =  vr/4,  do  =  ir / 2,  and  <po  =  0  which, 
for  a  coupler  of  length  z  =  L,  reduces  Eqn  (24)  to 


at  (L) 
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1 

-j 

'  01  (0)  ' 
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Geometrical  representation 

General  form 


It  can  be  shown  [1]  that  our  map  for  the  vectors  is  equivalent  to  the  statement  that 


«  =  H  Is) 


(26) 


and  that  the  equation  of  motion  is 

ds  _ ,  _ , 

—  =  2k  x  s,  (27) 

dz 

namely  a  precession  of  s  about  the  vector  k  at  a  rate  of  2k  rad/m.2  This  can  be  proven  by  differentiating 
Eqn  (26),  using  Eqn  (16),  the  equation  of  motion,  as  well  as  its  adjoint,  and  applying  the  commutator 
relations  for  the  nr  [1,  2,  3,  12].  Here,  we  approach  the  problem  from  the  geometrical  direction. 

Our  derivation  of  the  general  coupler  equation,  Eqn  (24),  by  way  of  Eqn  (23),  shows  that,  by  construc¬ 
tion,  its  eigenvalues  are  the  diagonal  elements  e^KZ,  and  the  eigenvectors  are  |s)  and  |/),  as  given  in  Eqn 
(19).  The  matrix  in  brackets  inside  Eqn  (23)  is  familial-  from  the  discussion  surrounding  Eqn  (12).  There, 
we  saw  that  such  a  diagonal  form  for  the  matrix  was  equivalent  to  rotation  about  the  63  axis.  In  Eqn  (23), 
however,  we  have  a  similarity  transform:  the  diagonal  matrix  is  sandwiched  between  S  and  S'.  This  means 
that  the  operator  is  not  rotating  about  the  63  axis,  but  is  performing  the  “similar”  operation  in  the  new 
co-ordinate  system  defined  by  the  similarity  transform.  In  other  words,  the  operator 


-3  HZ  Q 

0  e+jRz 


S' 


(28) 


performs,  in  the  new  basis,  the  operation  “similar  to”  the  operation  performed  by 


e~jRz  0 

0  e+3HZ 


(29) 


2In  the  telecommunications  literature,  it  is  customary  to  suppress  the  factor  of  2  in  this  equation  by  dividing  terms  in  the  2x2 
matrices  by  2  [2,  3,  12], 
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in  the  original  basis.  That  is,  in  the  co-ordinate  system  defined  by  S',  namely  |s)  — y  |1)  ~  §3,  the  operator 
in  Eqn  (28)  creates  a  rotation  about  the  vector  s  with  angular  velocity  2 k,  just  as  the  operator  in  Eqn  (29) 
does  about  the  §3.  Thus,  Eqns  (23,24)  in  the  2D  space  describe,  in  the  geometric  representation,  a  uniform 
rotation  (i.e.  at  constant  rate  as  z  increases)  about  the  unit  vector  corresponding  to  the  eigenstate  of  Eqn 
(16),  namely  k.  Indeed,  this  is  the  same  form  we  have  discussed  in  the  context  of  polarization  optics  in 


Fig.  11:  Coupler  evolution:  general  case.  Initial  state  | r/y)  evolves  into  state  jC/)  under  the  influence  of 
coupler  operator  k. 

earlier  work  [2,  3]. 

The  general  geometric  solution  of  the  coupler  equation  is  shown  in  Fig.  1 1 .  The  components  of  k,  found 
either  by  the  components  in  Eqn  (9)  or  by  knowledge  of  the  eigenstate  angles  do  and  do  in  Eqns  (20,  21), 
create  the  vector  about  which  a  typical  vector  s  (representing  a  normalized  input  state  to  the  coupler,  in  our 
normal  form)  rotates.  The  magnitude  of  the  rotation  (i.e.  the  angle  about  it  through  which  the  trajectory 
from  I'l/ji)  to  | ipf)  passes)  is  2Rz.  The  rotation  does  not  change  the  amplitude  of  s,  in  the  same  way  that  Eqn 
(24),  a  unitary  matrix,  doesn’t  change  the  magnitude  of  the  2D  vector.  A  more  general  form,  including  an 
overall  phase  constant,  was  treated  earlier  [2,  3]  to  be  able  to  incorporate  the  overall  propagation  constant 
and  topological  phase,  which  we  have  suppressed  in  this  report.  That  approach  entails  “fiducial  paddles” 
which  track  the  overall  phase  of  the  state,  necessary  when  dealing  with  interferometric  problems.  It  includes 
a  more  graceful  way  of  handling  the  branch  cut  discussed  above. 


Features  of  the  geometric  representation 

Several  features  of  coupling  which  are  not  immediately  clear  in  the  2D  representation  cam  be  gleaned 
from  viewing  Fig.  11. 

First,  recall  that  the  basis  set  we  are  using,  ±£3,  are  the  ports  or  waveguides  for  the  coupler.  If  one  of 
them  is  “slow”  we  conventionally  define  it  as  +§3.  Then,  by  virtue  of  normal  form  Eqn  (10),  the  input  to 
the  coupler  is  defined  by  both  the  relative  amplitudes  between  the  two  guides  and  the  phase  between  those 
two  optical  components.  For  light  introduced  into  one  guide  only,  | ipi)  is  ±§3. 
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Fig.  12:  Coupler  evolution:  ideal  3  dB  coupler.  Initial  state  | i//)  =  |1)  evolves  into  final  state  \ipf).  Vector 
for  coupler  operator  is  along  ei  axis,  and  creates  a  rotation  of  2 kz  =  n/2. 


Second,  in  the  geometric  approach,  there  is  no  clear  distinction  between  “coupling  strength”  as  defined 
by  k\,  k/2  (Eqn  (9)’s  “off-diagonal”  components)  and  “detuning”  as  defined  by  k-j,  a  measure  of  the  differ¬ 
ence  between  the  guides’  propagation  constants  in  Eqn  (5).  That  is,  three  k,  are  components  of  the  single 
vector  k. 

This  gives  rise  to  the  third  point:  there  is  a  definite  interplay  between  the  coupling  strength  and  the 
detuning.  Namely,  if  the  detuning  k->  is  much  greater  than  the  coupling  strength’s  magnitude  \J  n\  +  k%, 
then  it  is  impossible  to  attain  3  dB  coupling.  Indeed,  if  9q  <  7r/4,  the  circle  about  k  on  which  e:;  resides 
can  not  reach  the  ei  —  £2  plane.  Since  points  on  this  plane  are  at  angle  6  =  7r/2,  it  is  these  points  (more 
accurately,  points  on  the  “equator”  of  Fig.  11)  that  have  equal  powers  in  the  two  guides.  This  fact  is  more 
difficult  to  tease  out  of  the  2D  form  Eqn  (24)  than  the  geometric  form.  It  is  apparent  more  detuning  must  be 
compensated  by  more  coupling  to  retain  even  power  splitting.  In  fact,  one  can  see  that  a  more  robust  3  dB 
coupler  design  might  be  to  intentionally  detune  the  coupler  so  that  the  arc  finishes  tangentially  to  the  ei  —  §2 
plane. 

Fourth,  one  sees  that  complete  transfer  of  power  is  only  possible  if  (i)  K3  =  0,  i.e.  the  coupler  has  no 
detuning,  and  (ii)  2 kz  =  n/2.  Again,  this  is  obvious  from  the  geometric  picture,  but  requires  more  digging 
in  the  2D  representation. 

Finally,  Fig.  12  shows  the  typical  situation  for  an  ideal  3  dB  coupler  used  as  a  power  splitter.  The  basis 
states  are  the  two  uncoupled  guide  states,  |1)  and  1 2).  In  the  ideal  case,  the  guides  are  identical,  so  k;S  =  0: 
thus  k  lies  in  the  ei  —  §2  plane.  Specifically,  in  the  ideal  case,  only  k\  /  0,  so  k  lies  along  the  ei  direction. 
When  light  is  introduced  into  one  of  the  guides,  say  1 1) ,  it  evolves  as  it  propagates  through  the  coupler, 
rotating  about  k.  To  evenly  split  the  light  between  the  two  guides,  rotation  2 kz  for  the  coupler’s  length  2: 
is  7t/2:  at  this  point,  | ijjf),  the  final  state  is  7r/2  from  both  |1)  and  |2),  so  the  amplitude  in  each  guide  is 
cos  ^  .  There  is  a  differential  phase  shift  of  n/2  that  is  acquired  in  traversing  the  coupler:  this  is 

evident  from  looking  at  Fig  8  and  from  Eqn  (24).  If  this  coupler  were  part  of  an  interferometer,  we  would 
have  to  add  the  phases  we  suppressed  in  the  gauge  transformation,  as  well.  Earlier  maps  (Fig  3)  show  how 
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the  relative  phases  arise  in  the  various  output  states. 

Connection  to  conventional  eigenstate  analysis 

For  completeness,  we  include  a  view  of  coupled  waves  by  analyzing  it  using  the  conventional  eigensys- 
tem  method.  The  general  form  for  the  coupled  waves,  Eqn  (9),  must  have  the  form 

d  \  ai  1  .  [  k3  ki  -  jn2  1  f  at  1  [  ai  1 

—  =  —jk  (30) 

dz  L  «2  J  L  +  7^2  —  «3  J  L  a2  J  L  «2 

for  the  eigenvalues  k:  the  left  half  describes  the  equation  of  motion,  and  the  right  half  describes  the  oscil¬ 
latory  form  necessary  for  normal  modes.  When  the  right  half  of  this  eigenvalue  equation  is  solved,  we  find 
the  eigenvalues 

k  =  ±K  =  ±  \J ti\  +  K 2  + 

as  we  would  expect  from  the  discussion  above.  Recalling  Eqn  (7),  the  slowly  varying  transformation,  Eqn 
(30)  implies  that  there  are  two  allowed  solutions  for  the  propagation  constant,  k  ±  n,  for  a  field  introduced 
at  frequency  c o.  This  is  conventionally  displayed  on  a  dispersion  curve  such  as  Fig  13. 

The  basic  features  of  the  dispersion  curve  are  shown  in  the  inset,  which  is  most  easily  interpreted  by 
considering  the  wavenumbers  k,  on  the  horizontal  axis,  to  be  dependent  on  the  optical  frequency  u,  on  the 
vertical  axis.  We  have  been  discussing  the  case  for  a  single  optical  frequency,  represented  by  a  horizontal  line 
on  this  plot.  The  dashed  lines  represent  the  uncoupled  wavenumbers  for  the  two  waveguides  as  a  function  of 
u,  shown  as  distinct  in  this  figure,  for  generality.  At  the  “phase -matched”  frequency,  ccq.  the  two  uncoupled 
guides  have  the  same  wavenumber,  k o-  Since  the  phase  velocity  is  vp  =  c o/k,  the  uncoupled  guides  have 
the  same  phase  velocity  at  loq,  but  it  is  clear  that  they  have  different  group  velocities,  vg  =  du/dk,  since 
the  dashed  lines  have  different  slopes.  The  solutions  to  the  eigenvalue  equation  are  k±  =  k  ±  R,  and  are 
shown  as  the  solid  lines.  It  is  clear  that  k  is  always  less  than  the  minimum  of  k\ .  h>,  while  k+  is  greater. 
Thus,  k+,  for  instance,  is  the  “slower”  of  the  modes,  and  the  k±  approach  the  uncoupled  values  for  large 
values  of  detuning  u  —  ujq.  (In  this  simplification  of  a  dispersion  relation,  the  two  modes  have  constant 
group  velocities,  but  their  phase  velocities  change  with  frequency.) 

A  deeper  view  of  the  dispersion  curve  is  shown  in  the  larger  detail  of  Fig  13,  a  blow-up  of  the  region 
around  the  phase-matched  point.  For  generality,  we  choose  a  non-zero  detuning:  the  horizontal  line  is  for 
u  >  ccq.  The  four  dots  are  the  intersection  of  u  and  the  dispersion  curves,  with  the  abscissae  corresponding 
to  the  coupled  ( k-,k+ )  and  uncoupled  (k\ ,  k^)  wavenumbers.  It  is  clear  that  the  coupled  wavenumbers 
are  farther  from  the  mean  k  than  the  uncoupled  wavenumbers,  a  general  “splitting”  feature  of  perturbation 
analyses. 

Point  a)  on  the  diagram  shows  the  splitting  of  the  wavenumbers  for  the  case  of  zero  detuning.  In  this 
view,  the  magnitude  of  the  splitting  is  the  coupling  strength,  ^ n\  +  it  corresponds  to  the  largest  devia¬ 
tion  k±  has  from  uncoupled  k's.  For  the  more  general  case  of  non-zero  detuning,  the  uncoupled  wavenum¬ 
bers  separate  because  the  slopes  of  the  dashed  curves  correspond  to  different  vg  =  doo/dk.  The  amount  of 
the  splitting  is  given  by  Eqn  (5)  and  has  the  geometric  representation  shown  at  point  b)  in  the  figure:  since 

_  di o  uj  —  ujq 

9 1  dk  ki  —  k0  ’ 

the  uncoupled  mode  splitting  is  related  to  the  detuning  and  group  velocities  by 

2av3  =  (w  -  woXv1  “  V1)- 
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Fig.  13:  Coupler  dispersion  curve.  Inset:  Plot  of  uncoupled  guide  dispersion  curves(dashed)  distorting  in  the 
presence  of  waveguide  coupling  (solid).  Detail:  The  deviation  of  the  coupled  curves  is  the  rss  deviation  in  k 
due  to  detuning  of  the  uncoupled  modes  (k;j,  which  increases  linearly  with  Aw),  and  the  idealized  “coupling 
strength”  \J k\  +  k|  at  zero  detuning.  The  rss  deviation  of  these  two  contributions  is  the  deviation  of  the 
coupled  propagation  constants  from  the  average  propagation  constant,  k. 


Then  the  deviation  of  the  coupled  wavenumbers  from  the  mean  k  is  the  root-summed-square  (rss)  of  the 
coupling  and  detuning  deviations,  namely  k  =  \/ as  shown  at  point  c)  in  the  figure.  This 
shows  why  the  deviation  is  the  coupling  strength  for  zero  detuning,  while  for  large  detuning  it  tends  to  the 
detuning  itself. 

In  contrast,  one  sees  in  the  geometric  view  of  Fig  11a  visual  way  of  describing  the  same  facts.  In  this 
view 


•  The  splitting  is  due  to  the  length  of  the  vector  k,  since  one  could  transform  to  that  basis  set. 

•  Magnitude  | k\  is  determined  by  the  detuning  (component  in  the  e:>  direction)  and  the  coupling  strength 
(components  in  the  §i  —  £2  plane) 

•  That  magnitude  is  found  by  the  Pythagorean  theorem  and  is  why  the  detuning  and  coupling  are  com¬ 
bined  as  a  root-summed-square 
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•  The  zero  detuning  limit  reflects  that  the  K3  component  is  zero,  so  the  magnitude  \k\  is  determined 
only  by  the  coupling  components 

•  The  large  detuning  limit  reflects  the  fact  that  the  \k\  — >  K3  i f  K3  >>  n  1 .  Ko . 

•  Not  only  is  there  a  description  of  the  eigenvalues,  the  coupler’s  component  evolution,  magnitude  and 
phase,  is  described  in  the  same  view. 

VII  SUMMARY 

In  this  report,  we  have  used  the  directional  coupler  as  a  model  for  coupled  mode  theory.  We  have 

•  shown  how  the  fast  phase  variations  in  the  mode  can  be  transformed  away  to  yield  the  standard  coupled 
mode  equations, 

•  used  a  phasor  approach  throughout  to  physically  motivate  the  coupling  processes, 

•  shown  how  the  phase  ambiguity  in  the  geometrical  approach  can  be  resolved, 

•  demonstrated  how  the  phases  are  related  in  co-ordinate  transformations, 

•  used  a  similarity  transform  analysis  to  provide  an  alternate  derivation  of  the  coupling  equation,  and 

•  established  connections  between  the  conventional  dispersion  maps  for  waveguides  and  the  geometri¬ 
cal  representation  of  coupled  waves. 
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